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Robust recovery of complex exponential signals from random 
Gaussian projections via low rank Hankel matrix reconstruction 

Jian-Feng Cai* Xiaobo Qu* Weiyu Xu* Gui-Bo Ye* 


Abstract 

This paper explores robust recovery of a superposition of R distinct complex exponential functions 
from a few random Gaussian projections. We assume that the signal of interest is of 2 N — 1 dimensional 
and R « 2 N — 1. This framework covers a large class of signals arising from real applications in biology, 
automation, imaging science, etc. To reconstruct such a signal, our algorithm is to seek a low-rank Hankel 
matrix of the signal by minimizing its nuclear norm subject to the consistency on the sampled data. Our 
theoretical results show that a robust recovery is possible as long as the number of projections exceeds 
0(R In 2 N). No incoherence or separation condition is required in our proof. Our method can be applied 
to spectral compressed sensing where the signal of interest is a superposition of R complex sinusoids. 
Compared to existing results, our result here does not need any separation condition on the frequencies, 
while achieving better or comparable bounds on the number of measurements. Furthermore, our method 
provides theoretical guidance on how many samples are required in the state-of-the-art non-uniform 
sampling in NMR spectroscopy. The performance of our algorithm is further demonstrated by numerical 
experiments. 


1 Introduction 

Many practical problems involve signals that can be modeled or approximated by a superposition of a few 
complex exponential functions. In particular, if we choose the exponential function to be complex sinusoid, 
it covers signals in acceleration of medical imaging [15], analog-to-digital conversion [20], inverse scattering 
in seismic imaging [1], etc. Time domain signals in nuclear magnetic resonance (NMR) spectroscopy, that 
are widely used to analyze the compounds in chemistry and protein structures in biology, are another type 
of signals that can be modeled or approximated by a superposition of complex exponential functions [16]. 
How to recover those superposition of complex exponential functions is of primary importance in those 
applications. 

In this paper, we will consider how to recover those complex exponentials from linear measurements of 
their superposition. More specifically, let x £ C 2 ^' 1 be a vector satisfying 

R 

= 3=0, l,...,2N-2, (1) 

k= 1 

where Zk £ C, k = 1 ,R, are some unknown complex numbers. In other words, x is a superposition of 
R exponential functions. We assume R <C 2 N — 1. When \zk\ = 1, k = 1,...,R, x is a superposition of 
complex sinusoids. When Zk = e~ Tk e‘ 2m ^ k , k = 1 ,,R, x models the signal in NMR spectroscopy. 
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Since R -C 2N — 1, the degree of freedom to determine x is much less than the ambient dimension 2N—1. 
Therefore, it is possible to recover x from its under-sampling [3,5,8,12], In particular, we consider to recover 
x from its linear measurement 

b = Ax, (2) 

where A £ C Mx ( 2JV_1 ) with M -C 2TV — 1. 

We will use a Hankel structure to reconstruct the signal of interest x. The Hankel structure originates 
from the matrix pencil method [14] for harmonic retrieval for complex sinusoid. The conventional matrix 

pencil method assumes fully observed x as well as the model order R, which are both unknown here. 

Following the ideas of the matrix pencil method in [14] and enhanced matrix completion (EMaC) in [10], we 
construct a Hankel matrix based on signal x. More specifically, define the Hankel matrix H £ C NxN by 

Hj k =x j+k , j, k = 0,1. (3) 

Throughout this paper, indices of all vectors and matrices start from 0, instead of 1 in conventional notations. 
It can be shown that H is a matrix with rank R. Instead of reconstructing x directly, we reconstruct the 
rank-i? Hankel matrix H , subject to the constraint that (2) is satisfied. 

Low rank matrix recovery has been widely studied [2,5,6,17]. It is well known that minimizing the nuclear 
norm tends to lead to a solution of low-rank matrices. Therefore, a nuclear norm minimization problem 
subject to the constraint (2) is proposed. More specifically, for any given x £ C 2N ~ 1 , let H(x) £ C NxN be 
the Hankel matrix whose first row and last column is x, i.e., [H(x)\j k = Xj+k- We propose to solve 

min ||iT(a:)||* ) subject to Ax = b, (4) 

X 

where || • ||* is the nuclear norm function (the sum of all singular values), and A and b are from the linear 
measurement (2). When there is noise contained in the observation, i.e., 

b = Ax + rj, 


we solve 

min ||iT(®)||*, subject to \\Ax — fe|| 2 < 5, (5) 

X 

where S = ||j 7||2 is the noise level. 

An important theoretical question is how many measurements are required to get a robust reconstruction 
of H via (4) or (5). For a generic unstructured N x TV matrix of rank R, standard theory [6,7,9,17] 
indicates that 0(NR ■ poly(log N)) measurements are needed for a robust reconstruction by nuclear norm 
minimization. This result, however, is unacceptable here since the number of parameters of H is only 2iV— 1. 
The main contribution of this paper is then to prove that (4) and (5) give a robust recovery of H (hence 
x) as soon as the number of projections exceeds 0(R In 2 N) if we choose the linear operator A to be some 
scaled random Gaussian projections. This result is further extended to the robust reconstruction of low-rank 
Hankel or Toeplitz matrices from its few Gaussian random projections. 

Our result can be applied to various signals of superposition of complex exponentials, including, but 
not limited to, signals of complex sinusoids and signals in accelerated NMR spectroscopy. When applied to 
complex sinusoids, our result here does not need any separation condition on the frequencies, while achieving 
better or comparable bounds on the number of required measurements. Furthermore, our theoretical result 
provides some guidance on how many samples to choose for the model proposed in [16] to recover NMR 
spectroscopy. 

• Complex sinusoids. When \z k \ = 1 for k = 1,.. ., R, we must have z k = e 27 ”A for some frequency /&. 
In this case, a; is a superposition of complex sinusoids, for examples, in the analog-to-digital conversion 
of radio signals [20]. The problem on recovering x from its as few as possible linear measurements (2) 
may be solved using compressed sensing (CS) [8]. One can discretize the domain of frequencies f k by 
a uniform grid. When the frequencies fk indeed fall on the grid, x is sparse in the discrete Fourier 
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transform domain, and CS theory [8,12] suggests that it is possible to reconstruct x from its very few 
samples via £-]-norm minimization, provided that R <C 2 N — 1. Nevertheless, the frequencies fk in our 
setting usually do not exactly fall on a grid. The basis mismatch between the true parameters and the 
grid based on discretization degenerates the performance of conventional compressed sensing [11]. 

To overcome this, [4,19] proposed to recover off-the-grid complex sinusoid frequencies using total varia¬ 
tion minimization or atomic norm [9] minimization. They proved that the total variation minimization 
or atomic norm minimization can have a robust reconstruction of x from a nonuniform sampling of 
very few entries of x , provided that the frequencies fk , k = 1,..., R, has a good separation. Another 
method for recovering off-the-grid frequencies is enhanced matrix completion (EMaC) proposed by 
Chen et al [10], where the Hankel structure plays a central role similar to our model. The main result 
in [10] is that the complex sinusoids x can be robustly reconstructed via EMaC from its very few 
nonuniformly sampled entries. Again, the EMaC requires a separation of the frequencies, described 
implicitly by an incoherence condition. 

When applied to complex sinusoids, compared to the aforementioned existing results, our result here 
does not need any separation condition on the frequencies, while achieving better or comparable bound 
of number of measurements. 

• Accelerated NMR spectroscopy. When Zk = e~ Tk e 27Tt ^ k , k = 1 x models the signal 

in NMR spectroscopy, which arises frequently in studying short-lived molecular systems, monitoring 
chemical reactions in real-time, high-throughput applications, etc. Recently, Qu et al [16] proposed an 
algorithm based on low rank Hankel matrix. In this specific application, A is a matrix that denotes 
the under-sampling of NMR signals in the time domain. Numerical results show its efficiency in [16] 
while theoretical results are still needed to explain. It is vital to give some theoretical results on this 
model since it will give us some guidance on how many samples should be chosen to guarantee the 
robust recovery. Though the result in [10] applies to this problem, it needs an incoherence condition, 
which remains uncertain for diverse chemical and biology samples. Our result in this paper does not 
require any incoherence condition. Moreover, our bound is better than that in [10]. 

The rest of this paper is organized as follows. We begin with our model and our main results in Section 
2. Proofs for the main result are given in Section 3. Then, in Section 4, we extend the main result to the 
reconstruction of generic low-rank Hankel or Toeplitz matrices. Finally, the performance of our algorithm is 
demonstrated by numerical experiments in Section 5. 


2 Model and Main Results 


Our approach is based on the observation that the Hankel matrix whose first row and last column consist of 
entries of x has rank R. Let H be the Hankel matrix defined by (3). Eq. (1) leads to a decomposition 


H = 

1 

Zl 

1 

ZR 
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Zl . . 

1 

1 


1 

is) 

.. 

1 

So ^ 

1 

1_ 



CR 


1 

zr ■ ■ 

Z R~ l \ 


Therefore, the rank of H is R. Similar to Enhanced Matrix Completion (EMaC) in [10], in order to 
reconstruct x , we first reconstruct the rank-I? Hankel matrix H, subject to the constraint that (2) is 
satisfied. Then, x is derived directly by choosing the first row and last column of H . More specifically, for 
any given x £ C 2Ar_1 , let H(x) £ C Wxiv be the Hankel matrix whose first row and last column is x 1 i.e., 
[H(x)\jk = Xj+k ■ We propose to solve 

min rank(iT(a:)), subject to Ax = b , (6) 

X 


3 








where rank(iT(a:)) denotes the rank of H(x), and A and b are from the linear measurement (2). When 
there is noise contained in the observation, i.e, b = Ax + 77 , we correspondingly solve 

minrank(iT(a;)), subject to ||^4sc — b\\ 2 < 6, (7) 

X 

where 5 = || 77 H 2 is the noise level. 

These two problems are all NP hard problems and not easy to solve. Following the ideas of matrix 
completion and low rank matrix recovery [6, 7,9,17], it is possible to exactly recover the low rank Hankel 
matrix via nuclear norm minimization. Therefore, it is reasonable to use nuclear norm minimization for our 
problem and it leads to the models in (4) and (5). 

Intuitively, our model is reasonable and likely to work. Theoretical results are desirable to guarantee 
it. The results in [6, 7, 9,17] do not consider the Hankel structure. For generic N x N rank-1? matrix, 
they requires 0(NR ■ poly (log N)) measurements for robust recovery which is too much since there are only 
2N — 1 degrees of freedom in H{x). The theorems proposed in [19] work only for a special case where signals 
of interest are superpositions of complex sinusoids, which excludes, e.g., the signals in NMR spectroscopy. 
While the results from [10] extend to complex exponentials, the performance guarantees in [4,10,19] require 
incoherence conditions, implying the knowledge of frequency interval in spectroscopy, which are not available 
before the realistic sampling of diverse chemical or biological samples. This limits the applicability of these 
theories. 

It is challenging to provide a theorem guaranteeing the exact recovery for model (4) with arbitrarily 
linear measurements A. In this paper, we provide a theoretical result ensuring exact recovery when A is 
a scaled random Gaussian matrix. Our result does not assume any incoherence conditions on the original 
signal. 

Theorem 1. Let A = BV £ qMx( 2 N-i), w h ere ft g C Mx ( 2 A r -i) ls a ranc [ orn matrix whose real and 
imaginary parts are i.i.d. Gaussian with mean 0 and variance 1, T> £ ]gOA r -i)x( 2 ./v-i) a diagonal matrix 
with the j-th diagonal y/j + 1 if j < N — 1 and y/2N — 1 — j otherwise. Then, there exists a universal 
constant C\ > 0 such that, for an arbitrary e > 0, If 

M > (CiVUlnN + V2e) 2 + 1, 

then, with probability at least 1 — 2e ~, we have 

(a) x = x, where x is the unique solution of (4) with b = Ax; 

(b) ||77>(A — x )\\2 < 2 5/e, where x is the unique solution of (5) with ||6 — * 4 i :||2 < S. 

The number of measurements required is 0(R In 2 N), which is reasonable small compared with the number 
of parameters in H(x). Furthermore, there is a parameter e in Theorem 1. For the noise-free case (a), the 
best choice of e is obviously a number that is very close to 0. For the noisy case (b), we can balance the error 
bound and the number of measurements to get an optimal e. On the one hand, according to the result in (b), 
in order to make the error in noisy case as small as possible, we would like e to be as large as possible. On 
the other hand, we would like to keep the measurements M of the order of l?ln 2 N. Therefore, a seemingly 
optimal choice of e is e = 0(R In 2 N). With this choice of e, the number of measurements M = 0(R In 2 N ) 

and the error \[D(x — x )\\2 < O (^ 7 ) ■ 

3 Proof of Theorem 1 

In this section, we prove the main result Theorem 1. 
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3.1 Orthonormal Basis of the TV x TV Hankel Matrices Subspace 

In this subsection, we introduce an orthonormal basis of the subspace of TV x TV Hankel matrices and use it 
to define a projection from C NxN to the subspace of all TV x TV Hankel matrices. 

Let Ej £ C NxN , j = 0,1, 2TV — 2, be the Hankel matrix satisfying 


V i if k + l=j, 

0 , 


otherwise, 


k, l = 0,..., TV — 1, 


( 8 ) 


where Kj = j + 1 for j < TV — 1 and Kj = 2TV — 1 — j for j > TV — 1 is the number of non-zeros in Ej. Then, 
it is easy to check that {Ej}™^ 2 forms an orthonormal basis of the subspace of all TV x TV Hankel matrices, 
under the standard inner product in <C NxN . 

Define a linear operator 


s 


2N-2 


X £ C 2JV_1 ^ g x = x j E j e C NxN 

j =0 


The adjoint Q* of Q is 


g* : x £ c NxN ^ g*x £ c 2JV_1 , 


\g*x]j = {x,Ej). 


(9) 


Obviously, g*g is the identity operator in C 2N 1 , and gg* is the orthogonal projector onto the subspace of 
all Hankel matrices. 


3.2 Recovery condition based on restricted minimum gain condition 

First of all, let us simplify the minimization problem (4) by introducing V £ c( 2Ar - 1 ) x ( 2W ^ 1 ) ; the diagonal 
matrix with j-th diagonal y/Kj . Then, by letting y = T>x , (4) is rewritten as, 

min \\gy\\* subject to By = b 1 ( 10 ) 

v 

where B = AV~ 1 . Similarly, for the noisy case, (5) is rearranged to 

min||t?y||* subject to \\By - b\\ 2 < e. (11) 

v 

By our assumption in Theorem 1, B £ <£ Mx (. 2N - 1 ) i s a random matrix whose real and imaginary parts are 
both real-valued random matrices with i.i.d. Gaussian entries of mean 0 and variance 1. We will prove 
y = Vx (respectively \\y — y H 2 < 2 S/e) with dominant probability for problem (10) for the noise free case 
(respectively ( 11 ) for the noisy case). 

Let the desent cone of \\g ■ ||* at y be 


£(») = (A* | A > 0, || g(y + z)||. < \\gy\\*}- (12) 

To characterize the recovery condition, we need to use the minimum value of for nonzero z £ T(y). 

This quantity is commonly called the minimum gain of the measurement operator B restricted on 'T(y) [9]. In 
particular, if the minimum gain is bounded away from zero, then the exact recovery (respectively approximate 
recovery) for problem ( 10 ) (respectively ( 11 )) holds. 

Lemma 1 . Let T(y) be defined by (12). Assume 


■ \\Bz\\ 3 

mm 

zex(y) ||z || 2 


> e. 


(13) 
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(a) Let y be the solution of (10) with b = By. Then y = y. 

(b) Let y be the solution of (11) with ||6 —jBt /|| 2 < 6. Then ||y — y || 2 < 25/e. 

Proof. Since (a) is a special case of (b) with 5 = 0, we prove (b) only. The optimality of y implies y—y £ 1(y). 
By (13), we have 

II y - vh < -II B{y - y) || 2 < -(||By - b|| 2 + ||By - b|| 2 ) < 2 5/e. 
e e 

□ 

Minimum gain condition is a powerful concept and has been employed in recent recovery results via l\ 
norm minimization, block-sparse vector recovery, low-rank matrix reconstruction and other atomic norms [9]. 


3.3 Bound of minimum gain via Gaussian width 

Lemma 1 requires to estimate the lower bound of ^ z \\/ ■ Gordon gave a solution using Gaussian 

width of a set [9,13] to estimate the lower bound of minimum gain. 

Definition 1. The Gaussian width of a set S C R p is defined as: 


w(S) := E C 


sup 7 T £ 
."/GS 


where £ G is a random vector of independent zero-mean unit-variance Gaussians. 

Let X n denote the expected length of a n-dimensional Gaussian random vector. Then A„ = v / 2T( Ii yi)/r(^) 
and it can be tightly bounded as ^, ” +1 < X n < yjn [9]. The following theorem is given in Corollary 1.2 
in [13]. It gives a bound on minimum gain for a random map II : i —> R". 

Theorem 2 (Corollary 1.2 in [13]). Let Q be a closed subset of {x £ R p |||£c || 2 = 1}. Let II £ R nxp be a 
random matrix with i.i.d. Gaussian entries with mean 0 and variance 1. Then, for any e > 0, 


min lind 
VzeQ 


2 > e > 1 — e 


-i(A„— w(Q)-e ) 2 


provided X n — w(Ll) — e > 0. Here ^ < A„ < ijn, and w(Ll) is the Gaussian width ofQ. 

By converting the complex setting in our problem to the real setting and using Theorem 2, we can get 
the bound of (13) in terms of Gaussian width of T k (j/) n where Tr(?/) is a cone in R 4Ar ~ 2 defined by 


£r( 2 /) 


OL 

(3 


a + i/3 £ T (y) \. 


(14) 


Lemma 2. Let the real and imaginary parts of entries of B £ C Mx ( 2N b be i.i.d. Gaussian with mean 0 
and variance 1. Let Tr (y) be defined by (14) and S 2JV ” 2 be the unit sphere in C 2N ~ 1 . Then for any e > 0, 


P 


( min 

z &%(y)nsl N ~ 2 


\\Bz\\ 2 > 




where 3 is the unit sphere in R 4Ar 2 . 

Proof. In order to use Theorem 2, we convert the complex setting in our problem to the real setting in 
Theorem 2. We will use Roman letters for vectors and matrices in complex-valued spaces, and Greek letters 
for real valued ones. Let B = & + i\ 1/ £ C Mx ( 2Ar - 1 ) ; where both $ £ R Mx ( 27V ~ 4 ) and U/ £ R Mx ( 2Ar ~ 4 ) 
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are real-valued random matrices whose entries are i.i.d. mean-0 variance-1 Gaussian. Then, for any z = 
a + if3 € C 2N ~ X with a, f3 € , 


\Bz\\ 2 = ||(# + *¥)(a + i/3)h = ||(*a - #/3) + + #/3 )|| 2 


[# -#] 


# # 


1/2 


Then 

min 

z=a.+ipe‘Z(y)n§, 2 c N - 2 

implies 

Therefore, 


[# -#] 


> e/y/2, and min 

z=a+z/3e‘r(t/)ns? iv “ 2 


min ||Bz || 2 > e. 

z^(y)ns 2N ~ 2 


min ||Sz ||2 > e ] > P ((15) holds true). 


[# #] 


> e/V2 (15) 


\zex(i/)ns^ 

It is easy to see that both [# — #] and [# #] are real-valued random matrices with i.i.d. Gaussian 

entries of mean 0 and variance 1. By Theorem 2, 


P ((15) holds true) > 1 — 2e 2 


(A M -'u;(‘i: E (y)nSg JV_3 )-^j) 2 


and therefore we get the desired result. 


□ 


3.4 Estimation of Gaussian width w(Tr (y) fl §g A? 3 ) 

Denote Tjjj(y)) be polar cone of Tr(i/)) £ R 4W " 2 , i.e., 

£«(y) = {<5 G R 4Ar_2 1 1 T 8 < 0, v 7 G Tr(j/)}. (16) 

Following the arguments in Proposition 3.6 in [9], we obtain 


w(TR(y) nSR W 3 ) = E I sup £ r 7|<E( mm |||- 7|| 2 ) , 


VTGlElylnSg 




(17) 


where | £ R iN ~ 2 is a random vector of i.i.d. Gaussian entries of mean 0 and variance 1. Hence, instead 
of estimating Gaussian width iy(X®(y) fl §r W_ 3 ), we bound E ^min Te ^j(y) ||| — 7 H 2 ) ■ For this purpose, let 
jr . ]^ 4 JV -2 be defined by 


T 


fl 


= \\g(a + i 0 )\\.. 


(18) 


The following lemma gives us a characterization of E ^min 7 £ .j»(£) ||| — 7 H 2 J in terms of the subdifferential 
dT of T. 

Lemma 3. Let Tj^(y) and T be defined by (16) and (18) respectively. Let u>i,u >2 £ R 2Ar_1 be the real and 

w 1 


imaginary parts of y respectively and denote Cj = 


oj 2 


. Then 


Tg(y) = cone ( dJ- (w)) = {A5 | A > 0, T (7 + us) > T (a>) + 7 T S, V 7 £ R 4JV 2 } . 


(19) 
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Proof. It is observed that Tr(t/) in (14) is the descent cone of the function T 

^R(y) = {^7 I S > 0, T (7 + Cj) < T (w)} . 

According to Theorem 23.4 in [18], the cone dual to the descent cone is the conic hull of subgradient, which 
is exactly (19). □ 

The following lemma gives us an estimation of Gaussian width w( Tr(i/) D Sj^ -3 ) in terms of E(||^g|| 2 ). 

Lemma 4. Let Tr (y) and Q be defined by (14) and (9) respectively. Then 

wpPv) n S^" 3 ) < 3VR ■ E(\\Ggh), 

where E(||C?gr|| 2 ) is the expectation with respect to g G C 2N ~ 1 . Here g is a random vector whose real and 
imaginary parts are i.i.d. mean-0 and variance-1 Gaussian entries. 

Proof. By using (17) and Lemma 3, we need to find dT ( Cj) and thus TJ {(y). Let = Quj 1 and Cl 2 = G& 2 - 
Then Gy = Sli + ?.f2 2 - Let a singular value decomposition of the rank-1? matrix Gy be 


Gy = USV*, with 17 = ©!+* 0 2 , y = H 1 +*H 2 , ( 20 ) 

where ©i, © 2 , Si,S 2 G R NxR and S G M. RxR , and U G C NxR and V G C NxR satishes U*U = V*V = I. 
Then, by direct calculation, 


© = 


0 ! 

©2 


©2 

0 ! 


G K 2JVx (2ii) ) 




G R 2Arx G- R ) 


( 21 ) 


satisfy © T © 


a' 1 S = I. Moreover, if we define SI = 


S^! 

CI 2 


— CI 2 
S4x 


then 


n = © 


s 



( 22 ) 


is a singular value decomposition of the real matrix Cl, and the singular values Cl are those of Gy, each 
repeated twice. Therefore, 

^M = IISy||* = ||S[l* = ^||*. (23) 


Define a linear operator £ : 


h-> R2ATX2JV by 


( 

Ot 

\ 

Ga 

-GP 


p 

)- 

G (3 

Ga 


, with ex, (3 G 


d 2 *V —1 


By (23) and the definition of S~2, we obtain = ^||fd>||». From convex analysis theory and Cl = £ui, the 
subdifferential of J- is given by 


dH{Co) = -£*s||n||,. 


On the one hand, the adjoint £* is given by, for any A = 


*-»n ^12 

A 21 A 2 2 


£*A 


G*( An + A22) 

G*(A 2i - A 12 ) 


(24) 

G M 2Nx2N with each block in M. NxN , 

(25) 


On the other hand, since (22) provides a singular value decomposition of Cl, 

0||fi||. = {©S T + A | 0 t A = 0 , A3 = 0 , || A || 2 < 1} . 
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(26) 


















Combining (24) (25) (26) and (21) yields the subdifferential of J- at 


dT{u>) = 


A11+A2 


g* (& 1 sJ + e 2 m 
g* (© 2 sf - + A2i ~ A 


A n 

A 21 


A12 
A 22 


0 t A 


0, A3 = 0, 


2 ^ 


< 1 


We are now ready for the estimation of the Gaussian width. Let the set 6 be a subset of the set of 
complex-valued vectors 


© = {g*{uv* + w)\u*w = o, wv = 0, \\w \\ 2 < 1}, 

where U , V are in (20). Then, it can be checked that 


(27) 




a + if3 £ © > c 


(28) 


Actually, for any W = + *A 2 satisfying U*W = 0, WV = 0 and ||W || 2 < 1, we choose A = 

Ai — A 2 

A 2 Ai 

(©iS'f + © 2 “^ + Aj) + *(©,3, 

With the help of (28), we get 


. Obviously, this choice of A satisfies the constraints on A in dT(u>). Furthermore, UV*+W = 
T 1 ©iST + A 2 ). Therefore, (28) holds. 


min J£- 7||2 =min min ||£ - A 7 H 2 < minmin ||£ - A-y|| 2 - (29) 

'y^u(y) A>0 'y£dJ r {u>) A>0 'yef) 


We then convert the real-valued vectors to complex-valued vectors by letting g = + *£ 2 and c = 71 + * 72 , 

where £1 and £ 2 are the first and second half of £ respectively and so for 71 and 72 . This leads to 


min ||£- 7 ll 2 < minmin ||£ - A 7|| 2 = minmin ||g - Ac|| 2 . 
T£~S(y) ^>0 T£-t> a>o ce© 


Since Q*Q is the identity operator and QQ* is an orthogonal projector, for any A > 0 and c £ S, 


ll<7 - Ac|| 2 = II Qg - \g c \\ F = \\Q g - xgg*(uv* + W) Ilf 

= (II Qg - X(UV * + W)\\jr - ||A(J - gg*)(UV* + W)f F ) 1/2 (30) 

< \\Gg-\(UV* + W)\\ F , 


where W satishes the conditions in the dehnition of © in (27). Dehne two orthogonal projectors Vi and V 2 
in C NxN by 


ViX = UU*X + XVV* - UU*XVV *, V 2 X = (J - UU*)X(I - VV*). 

Then, it can be easily checked that: V\X and V 2 X are orthogonal, X =T\X +V 2 X . and 

V-JJV* = UV*, V 2 W = 0, V{W = 0, V 2 W = W, (31) 

where U, V, W the same as those in (27). We choose 

X = \\V 2 (gg)h, W = jV 2 (Qg). 

Then, W satisfies constraints in (27). This, together with (29)(30)(31), implies 

min U-'yh<\\gg-\\P 2 (gg)hUV*-P 2 {Qg)\\ F =\\Pi(Qg)-\\P2(Q9)hUV*\\ F 
-ye ££(w) b 

< \\Vi{Gg)\\ F + \\v 2 (gg)\\ 2 \\uv*\\ F = wv^gg^F + VR\\v 2 (g g )\\ 2 . 
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We will estimate both \\Vi(Gg)\\ F and \\V 2 (Gg)\\ 2 - For \\Vi(Gg)\\F, we have 


\\Vi(Gg)\\ F = || UU*(Gg) + (Gg)VV* - UU*{Gg)VV*\\ F = \\UU*(Gg) + (7 - UU*)(Gg)VV*\\ F 
< \\UU*(Sg)\\ F + ||(7 - UU*){Gg)VV*\\ F < \\UU*(Gg)\\ F + \\(Gg)VV*\\ F 

<2VR\\Ggh 

where in the last line we have used the inequality 

\\UU*(Gg)\\ F < \\UU*\\ F \\Ggh < ^R\\Ggh 

and similarly \\(Gg)VV*\\ F < VR\\Ggh- For ||:P 2 (C?g)|| 2 , 

\\r 2 (Gg)h = 11(7 - UU*)(Gg){I - VV*)\\ 2 < ||7 - UU*\\ 2 \\Gg\\ 2 \\I - VV*|| 2 < \\Gg\\ 2 . 

Altogether, we obtain 

“in ||^-7||2 < 2 ,\/R\\Gg\\2, 

t 65:S(w) 

which together with (17) gives 


n §jf-3) < 3 Vr ■ E(\\Ggh). 


□ 


3.5 Bound of E(||£/</|| 2 ) 

The estimation of E(||t?^|| 2 ) plays an important role in proving Theorem 1 since it needed to give the tight 
bound of the Gaussian width zu(X®(y) D S^ -3 ). The following Theorem gives us a bound for E(||£g|| 2 ). 

Theorem 3. Let g 6 K 2iv_1 be a random vector whose entries are i.i.d. Gaussian random variables with 
mean 0 and variance 1, or g £ C 2iv_1 a random vector whose real part and imaginary part have i.i.d. 
Gaussian random entries with mean 0 and variance 1. Then, 

E(||Gfl|| 2 )<CilniV, 

where C\ are some positive universal constants. 

The proof of Theorem 3 is relatively complicated. In order to help the reader easily understand the proof, 
we begin with real case and introduce some ideas and lemmas first. Assume g € R 2iv_1 has i.i.d standard 
Gaussian entries with mean 0 and variance 1. Notice that Gg is symmetric. Therefore, for any even integer k , 
(tr {Gg) k Y^ k is the fc-norm of vector of singular values, which implies ||(?g|| 2 < (tr (C?g) fc ) 1/,fc . This together 
with Jensen’s inequality, 


E(||6rg|| 2 ) < E ((tr (Gg) k ) 1/k ) < (E(tr (Gg) k )) ^ . (32) 

Thus, in order to get an upper bound of E(||C?g|| 2 ), we estimate E (tr ((f/g/j j ■ Denote M = Gg. It is easy 
to see that 

E(tr(M fc ))= £ E(M ili 2 M l 2 i 3 ...M ik _ llk M lkil ). (33) 

0<ii 

Therefore, we only need to estimate E( M iii 2 M i 2 i 3 ■ ■ ■ Mj fcil ). 

To simplify the notation, we denote iu+i = ii- Notice that M tl = , where gi+j is a random 

V Ki+j 

Gaussian variable and Kj is defined in (8). Hence, Mi e ,i e+1 = if and only if ig_ + ii+\ = i F + i F+ \. 

In order to utilize this property, we would like to introduce a graph for any given index ii, i 2 ,..., ik and its 
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equivalent edges on the graph. More specifically, we construct graph with nodes to be *i, * 2 , ■ ■ ■, ik 

and edges to be (*i,* 2 ), (* 2 ,* 3 ), • • •, (*k-i,*k)> (h,h)- Let the weight for the edge (it,k+ 1 ) be it + it+\- The 
edges with the same weights are considered as an equivalent class. Obviously, Mi et i e+1 = Mi e ,.i e , + I if and 
only if (it, ie+i) and (if , it'+ 1 ) are in the same equivalent class. Assume there are p equivalent classes of the 
edges of -S’*! ,-» 3 ,...,* fc - These equivalent classes are indexed by 1,2, ... ,p according to their order in the graph 
traversal i\ —»• *2 — > ... — > ik —> ii- We associate the graph $i lt i 2 ,...,i k a sequence C 1 C 2 .. .Ck, where Cj is the 
index of the equivalent class of the edge (ij,ij+ 1 ). We call C 1 C 2 ... Ck the label for the equivalent classes of 
the graph ■ 

The label for the equivalent classes of the graph dii,i 2 ,...,i k plays an important role in bounding E( |||| 2 )■ 
In order to help the reader understand this concept better, we give two specific examples here. For N = 
6 , k = 6,ii = 1, *2 = 4, *3 = 1, i\ = 3,*5 = l,i$ = 4, we have a corresponding graph and its label for the 
equivalent classes of the graph is 112211. For N = 6 , k = 6 ,*i = 2 ,i 2 = 3, *3 = 2, 14 = 4,i 5 = 2, ie = 3, 
the label for the equivalent classes of the corresponding graph is 112211 as well. Therefore, there may be 
several different index sequences * 1*2 ... ik that correspond to the same label for the equivalent classes of the 
corresponding graph. Let 21 Cl c 2 —c k be the set of indices whose label of equivalent class of the corresponding 
graph is cic 2 ... c*,, i.e. 


%c 1 c 2 ...c k = {* 1*2 • ■ -ik\ the label for the equivalent class of the graph is C 1 C 2 ... Ck} (34) 

For given C 1 C 2 .. - Ck, 21 Cl c 2 ...c k is a subset of {ii *2 ■ • ■ ik\ij € {0,1, ..., N — 1}, V j = 1, ..., k}. The 
following lemma gives us an estimate for the bound Z)» 1 * 2 ...i fc esa ( . 1C2 Ck E (M ili 2 M i2i3 ... M ik _ lik M lkil ). 

Lemma 5. Let C be the Riemann zeta function and 2 i r ,m n. be defined in (34). Define B(s ) = In (N + 1) 
if s = 2 and B(s ) = C(s/2) < tt 2 /6 for s > 4. Then 

p 

E E(M ili 2 M i 2 i 3 ...M ik _ lik M ikil )<Nl[B(st)(st-l)\\ (35) 

ii i 2 ■ ■ -i/e c 2 ■ ■ ■ c k £ = 1 

where p is the number of equivalent classes shown in C 1 C 2 .. .Ck, and st, t = 1 ,... ,p, is the frequency of £ in 
C 1 C 2 ... Ck- 

Proof. We begin with finding free indices for any ii, ii, ■ ■ ■, ik in the set 21 Cl c 2 ...c k ■ Let (ji, J 2 ) be the first edge 
of the class 1. Therefore, the weight of the first class is j 1 + j 2 - For convenience, we define k\(j\) = j\. The 
first edge of the class 2 must have a vertex fc 2 (j l 'i, j‘i), depending on j\ and J 2 , and a free vertex, denoted by 
j 3 . The weight of the second class is ^(jq, Similarly, the first edge in class 3 has a vertex k 3 (j 1 , j 2 , j 3 ) 

and a free vertex j' 4 , and the weight is k 3 (ji , . 72 , j.i ) + J 4 , and so on. Finally, the first edge in class p has a 
vertex k p (j\,j 2 , ■ ■ ■ ,j P ) and a free vertex j p +i, and the weight is k p (ji,j 2 , ■ ■ ■ ,j p ) + j P + 1 - Recall that the 
entry My is , where gi+j is a random Gaussian variable. Therefore, for any * 1*2 .. .ik € 21 Cl c 2 ...c k , 

\ K i+j 

E (M lli 2 M l2i3 . ..M lk _ lik M lkil ) = f[ -l^E (g^) , (36) 

t=l R-rni 


where me = ke(j\, ;/ 2 , ■ • • ,jt) + je+i- Therefore, it is non-vanishing if and only if si, S 2 , ■ ■ ■, s p are all even. 
In these cases, 

P / -1 \ 11 

E(Mj 1 j 2 Mj 2 j 3 .. • Mj fe _ 1 j fc Mj fc j 1 ) = rfSe/2 ■ (37) 

e=i 


Summing (37) over 2t ClC2 ... Cfc , we obtain 


'y ] E(Mj 1 j 2 Mj 2 i 3 

i±l2 • • -ik C 2 • • - c fc 


N-1N-1 N—l N—l p 

■ ^i k _ 1 i k Mi k i 1 ) < EE-E E n 


ii=oj2=o tp=o j P +i=of=i 


(se-l)W 

tp- s £/^ 

Iy rri£ 


V V ( (si ~ iy - ! V l {S2 - 1)!! V 

* ^ f ^ I 7y" s i/2 t ^ I 7>^' s 2/2 / -j 

Jl=0.72=0 y fcl(jl)+j2 j3=0 \ k2(jl,j2)+j3 ^4=0 


N—l 

E 


(s P — 1 )!! 


K 


Sp /2 


j P + 1 —° k p 
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1 + 1/2 + 1/3 + ... 1/N < ln(N + 1 ) s = 2 , 

1 + 1 / 2 s / 2 + ... + 1/N S / 2 < ((s/2) s = 4, 6 ,... 


Since, for any 0 < c < N — 1, 


iV-l 


E 

t =0 


1 


rs/2 

1 X c+t 


< 


where £ is the Riemann zeta function. By defining B(s) = In (AT + 1) if s = 2 and B(s ) = ((s/2) < 7r 2 /6 for 
s > 4, the desired result easily follows. □ 


The desired bound for E(||t7fir|| 2 ) can be obtained if we know how many different sets of 2l ClC2 ... Cfe available 
in the set {i\i 2 ■ ■ ■ ik\ij 6 {0,1,..., N— 1}, V/ = 1,..., k}. Let 93 SlS2 ... Sp be the set of all labels of p equivalent 
classes with £-th class containing sg equivalent edges respectively, i.e. 

I C 1 C 2 ... Cp is a valid label of equivalent classes in graph dni2...i k 
C± C 2 '' 1 c pI and there are se £’s in the label C 1 C 2 ... c p 

Let be the set of all possible set of all possible choice of p positive even numbers si,...,s p satisfying 
si + S 2 + ■ ■ ■ + s p = k. Then 

E(tr(M fc )) = £ E(M ill2 M l2i3 ... M ik _ lik M ikil ) 

k/2 (39) 

^E E E .. E E(ALjii 2 Afj 2 i 3 ... Mi k _ 1 i k Mi k i 1 ) 

P —1 Sl...Sp£(£p Cl C 2 • • -C-k £ 23 s ^ S 2 ■ ■ ■ Sp ^1 ^2 • ■ £ 2lci C 2 ■ • ■ c^. 

By bounding the cardinality of fB slS2 ... Sp and C p , we can derive the bound E(tr(M fc )) hence E(||^g|| 2 ) for 
the real case. The complex case can be proved by directly using the results for the real case. Now, we are 
in position to prove Theorm 3. 




Proof of Theorem 3. Following (39), we need to count the cardinality of < B SlS2 ... Sp . For any C 1 C 2 ... c& G 
® Sl s 2 ...s p , we must have ci = 1. Therefore, there are choices of the positions of remaining l’s in 

C 1 C 2 ■ ■ ■ Cfe. Once positions for l’s are fixed, the position of the first 2 has to be the first available slot, we 
have choices for the positions of remaining 2’s, and so on. Thus, 




fk— 1 \ fk — s 1 — 1 \ (k — Si — ... — s p _i — 1 \ 

\S 1 - l) ' V *2 - 1 ) ' " ' V S p - 1 ) 

(k — 1 )(k — 2 )... (k — si + 1 ) (k — si — 1 )... (k — si — S 2 + 1 ) 

(si - 1)! (s 2 - 1)! 

_(fc ~ 1 )!_ 

n?=i(^ - i)i -«!-■■•-«/)’ 


which together with (35) implies, for any S 1 S 2 ■ • ■ s p G £ p , 


E .. E £-(Mi!i 2 Mi 2 i 3 ... Mi k _ 1 i k Mi k i 1 ) 

Cl C 2 ■ ■ - Cfc G 23 siS2-..Sp c<2 • ■ - Cfc 

<N -tE- TT B (s t ) 

" ULM - 2)!! UVi(k - si -...-at) 


Summing (40) over <£ p yields 

E. E .. E E(Miii 2 M i2i3 ... Mi k _ 1 i k Mi k j /1 ) 

Si ...SpG^p CiC 2 • • -Cfc G23 si S2 • • • Sp M ^2 • •^^ci C2 • • • c^. 

nti B( St ) 


<N(k — 1)! Y, 


ai .... Bee „ nLi(s.-2)!!ns(fc-s!-...-^) 


(40) 


(41) 
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Let us estimate the sum in the last line. Let s be the number of 2’s in S 1 S 2 ... s p . Then, 

f[B(s e ) <ln s {N + 1) fyV . (42) 

l =l ' ' 

Since each si,..., s p > 2 and there are p—s terms greater than 4 among them, we have 

p 

HO* - 2)!! > 2P~ S (43) 

i =i 


and k — Si — ... — St = se +i + ... + s p > 2 (p — £), which implies 


p —1 p —1 

J(fc - si - ... - se) > 2 (P-^) = 2 P-I (p — !)!■ 


(44) 


c=i 


£=i 


There are ( p ) choices of the positions of the s 2’s. Moreover, once the s 2’s in S 1 S 2 ... s p are chosen, there 
are at most 


k 


k 




choices of the remaining p — s s,’s. Altogether, 


E E E E(Af i ij 2 M i 2 i 3 .. ■ Mi k _ 1 i k Adi k i 1 ) 

Si-..Sp£(£p Ci C2 • •-C/j. G ® S2 • • • Sp ^1^2 ■ • C2 • •-c^. 


<N(k- 1)\J2 


s =0 


p\ f k 


sj V 2 

p 


p—s 


\n{N + l) 


2 \ P~ s 


1 




s=0 

2AT(fc- 1)! A fp\ U 2 k\ p ~ s /ln(JV + l)Y 

(P-1)! V 2 J 


p—s 


6 J 2P- s 2P~ 1 (p-l)\ 

2 \P~ S 2 


ln s (iV + 1) 


6 J 4P~ S 2 S 


(45) 


s=0 


=2N(k — 1)! 


ln(iV+l) 


)’ 


ip~ !)! 

Finally, (45) is summed over all possible p and we obtain 
E(tr(M fc ))= ^ E (M ili2 M i2i3 ...M ik _ lik M ikil ) 

i \,12 
k/2 

^E E E E 

P 1 Si...Sp£(£p C 1 C 2 • •-Cfc G® si S2 • •-Sp ^1 ^2 • • •‘^k £ 2lci c<2 • ■ • 


E(M ili2 M i2 i 3 ... M ik _ 1 i k M ikil ) 


(46) 


< 2N(k- l)l53 


k/2 ( n 2 j, , ln(JV+l) \ 


P = 1 


(P — 1)! 


By using the fact that, for any A > 0, 


k/2 


AP 


p =1 


A 2 


E(^7i)j -a(i + a + — + ... 


A k / 2-1 
(k/2 — 1)! 


< Ae / 
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(46) is rearranged into 


E(tr(M fe )) < 2 N{k - 1)! (—k + ln ^ + ^ e £k+ l ^P- = 2 Ny/W+l{k - 1)! ( — k + + 1 A e & 

\48 2 J \48 2 J 

< 2{N + l)5fc fc ( (- + 


3, t /'f 2 ln(7V + l)\ wl 


e 48 

48 ' 2 k 

Let k be the smallest even integer greater than ln(7V + 1). Then using ||M || 2 < (tr(M fc )) 1 / fc lead to 

Jl |„/AF i 1\\ 1 / k 


E(||M|| 2 ) < E((tr(M fc )) 1 /fe) < (E(tr(M fc ))) 1/fe < (2(IV + 1 )^ k k \ + 1} 


e 48 


24 


< 224 in(jv+i) . e is . _ In (AT + 1) • — 

7 r 2 V 24 


/tj- 2 \ 241n(W + l) ^2 


• e 48 < C\ In N , 


where the constant C\ is some universal constant. 

Next, we estimate the complex case. In this case, g € C 2 ^” 1 , where both its real part and imaginary 
part have i.i.d. Gaussian entries. Write g = £ + irj, where $,,g £ R 2 ^ 1 are real-valued random Gaussian 
vectors. From the real-valued case above, we derive 


Therefore, 


E(||^|| 2 ) < Ci InIV, E(||£fy|| 2 ) < Ci In N. 

EtWGgh) = E(||6?€ + tSvh) < E(» 2 ) + E(||^|| 2 ) < 2C X In N. 


□ 


3.6 Proof of Theorem 1 


With Lemmas 1, 2, 4, and Theorem 3 in hand, we are in position to prove Theorem 1. 


Proof of Theorem 1. Since (10) is equivalent to (4) by the relation y = Vx, we only need to prove that 
y = y for noise free data ( \\y — y|| 2 < 2<5/e for noisy data) with dominant probability. According to Lemma 
1, we only need to prove (13). By Lemma 2, 


P 


( min 

ze%(y) nS 2 c N ~ 2 


\\Bz\\ 2 > 


^_2e 2 


Lemma 4, Theorem 3, and the inequality A m > y/M+i imply that 

Am - MT R (y) n S^" 3 ) - -L > ^== - ZCiVRlnN- -L > VW^l - 3(7^In IV - -^=. 

When M > (6Ci\/l?ln IV + \[2e) 1 + 1, we can easily get P (min ze3; ^ n g 2 Ar -2 ||£> 2 || 2 > e'j > 1 — 2e M » 1 . We 
get the desired result. □ 


4 Extension to Structured Low-Rank Matrix Reconstruction 

In this section, we extend our results to low-rank Hankel matrix reconstruction and low-rank Toeplitz matrix 
reconstruction from their Gaussian measurements. 

Since the proof of Theorem 1 does not use the specific property that y is an exponential signal, Theorem 
1 holds true for any low-rank Hankel matrices. We have the following corollary, which reads that any Hankel 
matrix of size N x N and rank R can be recovered exactly from its 0(R In 2 N ) Gaussian measurements, and 
this reconstruction is robust to noise. 
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Corollary 1 (Low-Rank Hankel Matrix Reconstruction). Let H £ c JVxAr be a given Hankel matrix with 
rank R. Let x £ C 2 ^ 1 be satisfying Xi+j = Hij for 0 < i, j < N — 1. Let A = BV £ C Mx ( 2Ar ~ 1 ) ) where 
B £ C Mx ( 2Ar -D is a random matrix whose real and imaginary parts are i.i.d. Gaussian with mean 0 and 
variance 1, T> £ R( 2Ar_1 ) x ( 2JV_1 ) is the same as defined in Theorem 1. Then, there exists a universal constant 
Ci > 0 such that, for any e > 0, if 

M > {CiVRlnN + y/2 e) 2 + 1, 

M — 1 

then, with probability at least 1 — 2e s - , we have 
(a) H ( x ) = H, where x is the unique solution of 

min ||i?(a;)||* subject to Ax = b 

X 


with b = Ax; 

(b) \\H(x)-H)\\ f < 2 5/e, where x is the unique solution of 

min || H(x)\\* subject to \\Ax — 6 ||2 < <5 

X 


with || b — Ax ||2 < 5. 

Moreover, Theorem 1 can be extended to the reconstruction of low-rank Toeplitz matrix from its Gaussian 
measurements. Let T £ C JVxN be a Toeplitz matrix. Let x £ C 2Ar_1 be a vector satisfying Xn-i+U- j) = Ti,j 
for 0 < i, j < N — 1. Let P £ C NxN be an anti-diagonal matrix with anti-diagonals of 1. Then, it is easy 
to check that T = H(x)P. Thus, we define a linear operator T that maps a vector in C 2 ^ 1 to a N x N 
Toeplitz matrix by T{x) = H(x)P. Since P is a unitary matrix, one has ||T(a:)||* = \\H(x)P\\* = ||iT(a;)||*. 
Therefore, the above corollary can be adapted to low-rank Toeplitz matrices. We obtain the following 
corollary, which states that any Toeplitz matrix of size N x N and rank R can be recovered exactly from its 
0(R In 2 N ) Gaussian measurements, and this reconstruction is robust to noise. 

Corollary 2 (Low-Rank Toeplitz Matrix Reconstruction). Let T £ <C NxN be a given Toeplitz matrix with 
rank R. Let x £ C 2Ar_1 be the vector satisfying x^r-i+^-j) = Ti,j for 0 < i,j < N — 1. Let A = BV £ 
£Mx( 2 N-i)* where B £ C Mx ( 2Ar ^ 1 ) a random matrix whose real and imaginary parts are i.i.d. Gaussian 
with mean 0 and variance 1, V £ K( 2W_1 ) x ( 2Ar_1 ) is the same as defined in Theorem 1. Then, there exists 
a universal constant C\ > 0 such that, for any e > 0 , if 

M > (C 1 VR.\nN+ V2e) 2 + 1, 

M —1 

then, with probability at least 1 — 2e ~, we have 
(a) T{x) = T, where x is the unique solution of 

min ||T(a;)||* subject to Ax = b 

X 


with b = Ax; 

(b) ||T(*) — T)|| f < 2 5/e, where x is the unique solution of 

min ||T’(a:)||* subject to ||Ax — fo ||2 < 5 

X 


with ||fo — Ax || 2 < S. 
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(a) Our method: Hankel nuclear norm (b) Off-the-grid CS [19]: Atomic norm (c) EMaC [10]: Hankel nuclear norm 
minimization with random Gaussian pro- minimization with non-uniform sam- minimization with non-uniform sam- 
jections. pling of entries. pling of entries. 


Figure 1: Numerical Results. 


5 Numerical Experiments 

In this section, we use numerical experiments to demonstrate our result and its performance improvement, 
compared with the results in [10,19]. In the numerical experiments, we use superpositions of complex 
sinusoids as test signals. Note that the application of our result is not limited to such signals but any signals 
that are superpositions of complex exponentials. 

The true signal x is generated as follows. We choose N = 64, i.e., the dimension of x is 127. The 
frequencies ft, k = 1,... ,R, are uniformly randomly drawn from the interval [0,1]. The arguments of the 
coefficients Ck, k = 1 ,..., R, are from the interval [0,2-7 t] uniformly at random, and their amplitudes are 
generated by |cj| = 1 + 10°- 5mi where m, follows the uniform distribution on [0,1]. Then, we synthesize the 
true signal x by Xt = EtLi Cke l27T ^ kt for t = 0,1,..., 126. For each fixed M and R, we test 100 runs. We 
plot in Fig. 1(a) the rate of successful reconstruction by (4), which is solved by alternating direction method 
of multipliers (ADMM). We see from Fig. 1(a) that the phase transition of our method is very sharp. 

For comparison, we plot the phase transitions of off-the-grid CS [19] and EMaC [10] in Fig. 1(b) and 
Fig. 1(c) respectively. These figures are from [10] under the same setting as ours. We observe that, for the 
same R , our method generally needs smaller M than off-the-grid CS and EMaC to achieve a high successful 
reconstruction rate. This illustrates that empirically our method requires fewer measurements than both 
off-the-grid CS and EMaC for the exact reconstruction of complex sinusoid signals. Finally and importantly, 
our method does not need a separation condition of frequencies to guarantee a successful recovery. 
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